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ABSTRACT 


User clustering algorithms have been introduced to analyze 
users’ learning behaviors and help to provide personalized 
learning guides in traditional Web-based learning system- 
s. However, the explicit and implicit coupled interactions, 
which means the correlations between user attributes gener- 
ated from learning actions, are not considered in these algo- 
rithms. Much significant and useful information which can 
positively affect clustering accuracy is neglected. To solve 
the above issue, we proposed a coupled user clustering algo- 
rithm for Wed-based learning systems. It respectively takes 
into account intra-coupled and inter-coupled relationships 
of learning data, and utilizes Taylor-like expansion to repre- 
sent their integrated coupling correlations. The experiment 
result demonstrates the outperformance of the algorithm in 
terms of efficiently capturing correlations of learning data 
and improving clustering accuracy. 


Keywords 
Web-based learning, coupled interactions, user clustering, 
user behavior analysis 


1. INTRODUCTION 


Information technology and its application have brought 
great changes to all aspects of human, especially education 
area. Web-based learning is a significant and advanced way 
of education, meaning to utilize computer network technolo- 
gy, digital multimedia technology, database technology and 
other modern information technology to learn in digital en- 
vironment. Compared with traditional learning, Web-based 
learning can efficiently meet learners’ needs of learning any- 
time and anywhere. Meanwhile, it takes advantage of vari- 
ous online resources and helps learners to expand their hori- 
zons and discover interests. 
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Recently Web-based learning systems are studied by many e- 
ducation institutions and researchers, and a large number of 
online learning communities and virtual schools arise [1]. As 
an emerging online learning system, MOOC (Massive Open 
Online Courses) was initiated by America’s top universities 
in 2012. It had a participation of more than 6 million of stu- 
dents from around 220 countries within one year [2]. Some 
of Web-based learning systems apply user clustering algo- 
rithms to analyze learning behaviors and provide personal- 
ized learning services. Fu and Ofoghlu put forward a new 
clustering algorithm; it can extract clusters which can be 
described by overlapping layered concept in dense space [3]. 
According to the feedback of basic clustering method, Mon- 
tazer et al. proposed a hybrid clustering algorithm, which 
considered clustering issues from different perspectives, and 
kept the simplicity of basic clustering algorithm [4]. An- 
other matrix-based improved clustering algorithm was put 
forward by Zhang et al., and it is much more efficient when 
comparing with K-means [5]. Lin et al. came up with a kind 
of intuitionistic fuzzy kernel clustering algorithm (KIFCM), 
combining intuitionistic fuzzy sets and fuzzy kernel cluster- 
ing algorithm, and applied it in learner behavior analysis 
[6]. 


With the above algorithms utilized in Wed-based learning 
systems, learners’ attribute information is extracted by an- 
alyzing their behaviors, and finally used for user clustering. 
However, these algorithms generally neglect the explicit and 
implicit coupling relationships of user attributes and this 
may lead to massive significant information loss. For ex- 
ample, table 1 presents an evaluation index system based 
on information provided by a specified Web-based learning 
system. With common sense, we think that user attribute 
of “Average correct rate of homework” has a positive impact 
on “Comprehensive test result”. Generally, if the “Average 
correct rate of homework” is better, the “Comprehensive test 
result” is better. Students who behave this way are catego- 
rized in “normal” group. However, there are also students 
who can either get a better "Average correct rate of home- 
work” with a worse “Comprehensive test result”, or a better 
“Comprehensive test result” with a worse “Average correc- 
t rate of homework”; they are categorized in “unnormal” 
group. These unnormal situations are caused by irregular 
correlations of user attributes, but they are often ignored. 
This will certainly have negative effect on user clustering 
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Table 1: Comprehensive evaluation index system 
First- 
level Second-level index 
index 


Times of doing homework 
Average correct rate of homework 
Number of learning resources 
Total time length of Iearning resources 
Times of daily quiz 
Daily average quiz result 
Comprehensive test result 
Number of collected resources 
Times of downloaded resources 
Times of making notes 
Times of asking questions 
Times of marking and remarking 
Times of answering classmates’ questions 
Times of posting comments on the BBS 
Times of interaction by BBS message 
Times of sharing resources 
Average marks made by the teacher 
Average marks made by other students 
Times of marking and remarking made by 
the student for the teacher 
Times of marking and remarking made by 


the student for other students 


Autonomic 
learning 


Interactive 
learning 


accuracy. 


Nowadays an increasing number of researchers are study- 
ing the interactions between object attributes with special 
attention and have been aware that the independence as- 
sumption on attributes often leads to a mass of informa- 
tion loss. In addition to the basic Pearson’s correlation [7], 
Wang et al. put forward the intra-coupled and inter-coupled 
interactions of continuous attributes [8]. An innovative cou- 
pled group-based matrix factorization model for discrete at- 
tributes of recommender system was addressed by Li et al. 
[9]. Jakulin and Bratko proposed an algorithm to detect 
interactions between attributes, but it is only applicable in 
supervised learning with the experimental results [10]. For 
unsupervised learning, the coupled nominal similarity to ex- 
tract new relationships between entities was addressed by 
Wang et al., but it is only for categorical data [11]. We 
rarely find any methods applied in Web-based learning sys- 
tems, that consider coupling relationships of user attributes 
in user clustering. 


This paper proposed a coupled user clustering algorithm for 
Web-based learning systems, namely CUCA. It studies the 
coupling relationships of user attributes. With the help of 
Taylor-like expansion, we use a spectral clustering algorithm 
to cluster users. When it is applied in Web-based learning 
systems, it can efficiently capture learners’ behavior features 
and analyze the information behind them, especially that of 
“unnormal” group of learners, and finally use them to provide 
personalized learning services. To verify the outperformance 
of CUCA, we compare its clustering result with that of 3 
other algorithms, respectively from 3 dimensions of learning 
attitude, learning effect and the integrated dimension. 


The rest of the paper is organized as following. The clus- 
tering algorithm model is proposed in section 2. Section 
3 introduces the formalization and exemplification of the 
clustering algorithm. In section 4, experiments and results 
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analysis are demonstrated. Section 5 concludes the paper 
and discusses some potential applications of the proposed 
algorithm in the future. 


2. CLUSTERING MODEL 


Evaluation model usually plays the core role in user eval- 
uation framework [12]. In this section, the coupled user 
clustering model is illustrated. This model captures cou- 
pling relationships of user attributes through online behavior 
analysis, and uses spectral clustering algorithm to improve 
clustering accuracy. 
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Figure 1: The coupled user clustering model 


The model is composed of user learning behavior analysis, 
coupled interactions computation of user attributes, inte- 
grated coupling representation and spectral clustering algo- 
rithm, illustrated in figure 1. As the basis, data for user 
learning behavior analysis needs to be collected, consolidat- 
ed and normalized. From the data, user attributes infor- 
mation is extracted. With the extracted user attributes, 
the intra-coupled interaction within an attribute and inter- 
coupled interaction among different attributes are respec- 
tively captured. Then all the interactions are integrated 
and represented using Taylor-like expansion. Finally we use 
a spectral clustering algorithm - NJW to cluster users. This 
model is consequently applied in various Web-based person- 
alized services, like Learning guide customization, tutoring 
and learning resources recommendation. 


3. CLUSTERING ALGORITHM 


Based on the model illustrated in section 2, this paper pro- 
posed an online coupling user clustering algorithm. It is 


176 


Table 2: A fragment example of user attributes 


A 


ay a2 as a4 a5 a 
U 3 6 


U1 0.61 | 0.55 | 0.47 | 0.72 | 0.63 | 0.62 
u2 0.75 | 0.92 | 0.62 | 0.63 | 0.74 | 0.74 
U3 0.88 | 0.66 | 0.71 | 0.74 | 0.85 | 0.87 
UA 0.24 | 0.83 | 0.44 | 0.29 | 0.21 | 0.22 
Us 0.93 | 0.70 | 0.66 | 0.81 | 0.95 | 0.98 


suitable for network education, not only applicable to us- 
er clustering analysis in Web-based learning systems, but 
also to enterprise training, performance review and other- 
s with users participation and behaviors recording. This 
section describes the details of the proposed coupled user 
clustering algorithm. Firstly, it collects user learning be- 
havior information and extracts user attributes from them. 
Secondly, it calculates and represents users’ intra-coupled 
and inter-coupled relationship. Thirdly, the intra-coupled 
and inter-coupled interactions are integrated to be a cou- 
pled representation. Finally, it clusters users based on the 
processed attributes, using NJW algorithm. 


3.1 User learning behavior analysis 

When students login a Web-based learning system, the sys- 
tem will record their activity information, such as number 
of learning resources, total time length of learning resources 
and average correct rate of homework, which can be used 
to build an evaluation index system. We refer to a Web- 
based personalized user evaluation model [13] and utilizes 
its evaluation index system to extract students’ attributes 
information. This index system is with evaluation standards 
of America K-12 (kindergarten through twelfth grade) [14] 
and Delphi method [15], which is a hierarchical structure 
built according to mass of information and data generated 
during general e-learning activities. It defines 20 indicators 
and can comprehensively represent the students’ attributes, 
as shown in table 1. 


Generally attributes are with various data types and units, 
we formalize them by creating the table 2. 


3.2 Intra-coupled and inter-coupled represen- 


tation 

In this section, we represent intra-coupled and inter-coupled 
interactions of user attributes. And with a few examples, 
the application of CUCA is demonstrated. We choose 5 
students and 6 of the 20 attributes in table 1, which are 
“Average correct rate of homework”, “Times of doing home- 
work”, “Number of learning resources”, “Total time length of 
learning resources”, “Daily average quiz result” and “Com- 
prehensive test result”. The 6 attributes are respectively 
signified by ai, a2, a3, a4, a5 and ag¢ in table 2. 


Here we use a tetrad S = (U,A,V, f) to represent user at- 
tributes information. U = {ui,u2,...,WUm} means a finite 
set of users; A = {a1,@2,...,@n} refers to a finite set of 
attributes; V = Ufa V; represents all attributes value sets; 
Vj = {aj-v1,---,@j-Ue,; } is the value set of the j-th attribute; 
f =Uei fi. f7 «U > Vj is the function for calculating a 
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certain attribute value. For example, the information ta- 
ble 2 above contains 5 users {u1, U2, U3, U4, Us} and 6 at- 
tributes {a1, a2, 43, 4,45, a6}; the first attribute value of ui 
is fi(ur) = 0.61. 


The common way to calculate the interactions between 2 
attributes is Pearson’s correlation coefficient [7]. The us- 
er attributes from the Table 1 are continuous variables and 
approximate to Normal distribution, meeting the constrain- 
t condition of the Pearson’s correlation coefficient. Thus 
we use it to help to calculate attributes interactions in this 
paper. For instance, the Pearson’s correlation coefficient be- 
tween a; and a; is formalized as: 


Deucu (Fi (u) — Hy) (Fe(u) — pe) 


J Duew (Fi(&) — #9)? Deu (fe(u) — He)? 
(1) 


Cor(aj;, ax) = 


Where j4j, 4x are respectively mean values of aj, ax. 


The Pearson’s correlation coefficient helps to calculate the 
attributes interactions, but it fits for linear relationship on- 
ly, which is not sufficient to fully capture pairwise attributes 
interactions. Therefore we converts the original data at- 
tributes into a higher dimensional feature space to extract 
more attribute information [16]. 


Firstly, we use a few additional attributes to expand inter- 
action space. Then there are L attributes for each origi- 
nal attribute aj, including itself, namely (a;)', (a;)?, ... , 
(a;)". Each attribute value is the power of the attribute, 
for instance, (a;)* is the third power of attribute a;, (aj)? 
(1 < p < L) is the p-th power of a;. In table 3, the deno- 
tation a; and (a;)' are equivalent; the value of (a;)* is the 
square of that of a;. For simplicity, we set L=2. 


Secondly, the correlation between pairwise attributes is cal- 
culated. It captures both local and global coupling relations. 
We take the p-values for testing the hypotheses of no correla- 
tion between attributes into account. p-value here means the 
probability of getting the maximum correlation observed by 
random chance, while the true correlation is zero. If p-value 
is smaller than 0.05, the correlation Cor(aj,ax) is signifi- 
cant. The updated correlation coefficient is as: 


Cor(aj;,ax) if p-value<0.05, 
0 otherwise. 


R_Cor(aj, ax) = { (2) 


Here we do not consider all relationships, but only takes the 
significant coupling relationships into account, because al- 
1 relationships involvement may cause the over-fitting issue 
on modeling coupling relationship. This issue will go against 
the attribute inherent interaction mechanism. So based on 
the updated correlation, the intra-coupled and inter-coupled 
interaction of attributes is proposed. Intra-coupled interac- 
tion is the relationship between a; and all its powers; inter- 
coupled interaction is the relationship between a; and pow- 
ers of the rest attributes a; (k 4 j). 
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Table 3: Extended user attributes 


A ; ; 

U (a1)* (a1)?| (a2)* (a2)?} (a3)* (as)?| (aa)* (a4)?| (as)* (as)? (ae)* (a6)? 

U1 0.61 0.37 | 0.55 0.30 | 0.47 0.22 | 0.72 0.52 | 0.63 0.40 | 0.62 0.38 

U2 0.75 0.56 | 0.92 0.85 | 0.62 0.38 | 0.63 0.40 | 0.74 0.55 | 0.74 0.55 

U3 0.88 0.77 | 0.66 0.44 | 0.71 0.50 | 0.74 0.56 | 0.85 0.72 | 0.87 0.76 

UA 0.24 0.06 | 0.83 0.69 | 0.44 0.19 | 0.29 0.08 | 0.21 0.04 | 0.22 0.05 

U5 0.93 0.86 | 0.70 0.49 | 0.66 0.44 | 0.81 0.66 | 0.95 0.90 | 0.93 0.86 
Definition 1 Intra-coupled interaction. The intra-coupled ( 0 0 0.898 0.885 0.928 0.921 
interaction within an attribute is represented as a matrix. 0 0 0.929 0.920 0.879 0.888 


For attribute aj, it is an L x L matrix R’(a;). In the 
matrix, (p,q) is the correlation between (a;)? and (a;)? 
(l<p,q< JL). 


ain(j)  a1e(J) ait(j) 

ie a2i(j) a22(7) +++ ae2z(J) 
BR Mae: eee o 28 . eee © 

ari(j) ara(J) arx(j) 


Where Qpq(j) = R_Cor((a;)?, (a;)"%) is the Pearson’s corre- 
lation coefficient between (a;)? and (a;)?. 


For attribute a; in table 3 above, we can get the intra- 

1 0.986 

7 y a: . Ia _ 

coupled interaction of it as R’“(a1) = 0.986 1 ) 

which means that the correlation coefficient between at- 

tribute “Average correct rate of homework” and its second 

power is as high as 0.986. There is close relationship between 
them. 


Definition 2 Inter-coupled interaction. The inter-coupled 
interaction between attribute a; and other attributes a, 
(k # j) is quantified as an L x L * (n — 1) matrix as: 


R’(aj\{ax}ne;) = ( RT (ajlan,) R'*(ajlax,_1) ) 


(4) 


Bir(J|ki)  Br2(9] ki) Bir (3k) 


Rl (aj\an,) = as _ oe 
Bri(jlki) Bro(g|ki) Bii(ilki) 
(5) 


Here {ax}x4, refers to all the attributes except for a;, and 
Boq(g|ki) = R-Cor((a;)”, (ax,)%) is the correlation coeffi- 
cient between (a;)? and (ax,)? (1 < p,q < L). 


For attribute a; in the table 3 above, the inter-coupled inter- 
action between ai and others (a2, a3, a4, a5, a6) is calculated 
as: 

R'*(a1|{a2, a3, a4, a5, 06}) = 
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0.997 0.982 0.999 0.988 
0.978 0.994 0.982 0.999 


The p-values between a; and others (a2, a3, a4, a5, a6) is cal- 
culated as: 


"(ai |{a2,a3,@4,45,a6}) = 


P 
0.689 0.677 0.039 0.046 0.023 0.027 
0.733 0.707 0.023 0.027 0.050 0.044 


0 0.003 0 0.002 
0.004 0.001 0.003 0 


Based on the result, we can find that there is hidden cor- 
relation between user attributes. For instance, all the p- 
values between attribute a; and az are larger than 0.05, so 
the correlation coefficient is 0 based on Equation (2), indi- 
cating there is no significant correlation between “Average 
correct rate of homework” and “Times of doing homework”. 
Meanwhile, the correlation coefficient between a; and as, a1 
and ag is quite close to 1; it indicates “Daily average quiz 
result” and “Comprehensive test result” respectively have 
close relationship with “Average correct rate of homework”, 
which is consistent with our practical experiences. In conclu- 
sion, comprehensively taking into account intra-coupled and 
inter-coupled correlation of attributes can efficiently help 
capturing coupling relationships between user attributes. 


3.3 Integrated coupling representation 
Intra-coupled and inter-coupled interactions are integrated 
in this section as a coupled representation scheme. 


In table 3 above, each user is signified by L*n updated vari- 
ables A = {(a1)1,...,(a1)”,...,(an)1,...;(an)"}. With 
the updated function bes (w), the corresponding value of at- 
tribute (an)? is assigned to user u. Attribute a; and all its 
powers are signified as ti(a;) = if (it) ssh ie (uw)], while the 
rest attributes and all powers are presented in another vec- 


tor U({ax}exs) = [fay (U),-- +5 fi, (4), -- 5 fina (Woes 
fE,_,(u)]. For instance, in table 3, wi(a1) = [0.61, 0.37], 
ui({a2, a3, a4, a5, a6}) = (0.55, 0.30, 0.47, 0.22, 0.72, 0.52, 
0.63, 0.40, 0.62, 0.38]. 


Definition 3 Coupled representation. Attribute a;’s cou- 
pled representation is formalized as a 1x L vector u°(a;|A, L), 
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where (1, p) component corresponds to the updated attribute 


(a;)”. 


u®(aj|A, L) = u'*(aj|A, L) + u"*(ajA, L) (6) 
u!"(a;|A, L) = U(a;) Ow @ [R™(a;)]" (7) 


ul*(aj|A, L) = U({an}nzz) © [w, w,.--, w] 


8 
@[R**(aj\{ax}exs)]” 2 


where w = [1/(1!),1/(2!),...,1/(L!)] is a constant 1 x LZ 
vector, [w,w,...,w] isa 1x L*(n—1) vector concatenated by 
n—1 constant vectors w. © denotes the Hadamard product, 
and ® represents the matrix multiplication. 


Take an example in table 4, the coupled representation for 
attribute a1 is presented as u{(a:|A, 2) = [3.85, 3.80]. The 
reason we choose such a representation method is explained 
below. If the above Equation (6) is expanded, for exam- 
ple, we get the (1,p) element which corresponds to (a;)? of 
the vector u°(a,|A, L) as below, which resembles Taylor-like 
expansion of functions [17]. 


1° (a3|4,L) (a5)? = apr(4)- +y * Bel) 2 Gy 


} ee > Bp2 GI) 7 fr, (u) Box 


p Sees aad (iki) Fr fe (u ) 
(9) 


Finally we obtained the global coupled representation of all 
the n original attributes as a concatenated vector: 


u°(A, L) = [u°(a1|A, L), u°(a2|A, L),...,u°(an|A, L)| (10) 


With the couplings of attributes, each user is represented as 
a 1x Lx«n vector. When all the users follow the steps above, 
we then obtain an m x L «n coupled information table. For 
example, based on table 2, the coupled information table 
shown in table 4, is the new representation. 


3.4 User clustering 

We obtained the global coupled representation in table 4. 
Compared with the original representation, this one reflect- 
s coupling interactions of attributes, and contains far more 
coupling relationships. With these data, we can do user clus- 
tering using NJW [18], which is a kind of spectral clustering 
algorithm. Detailed clustering results are demonstrated in 
experiment later. 
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4. EXPERIMENTS AND EVALUATION 


In this section, we conduct experiments to verify the validity 
and accuracy of the proposed algorithm. The data for the 
experiments are collected from a Web-based learning system 
of China Educational Television (CETV), named “New Me- 
dia Learning Resource Platform for National Education”’. 
As a basic platform for national lifelong education, which 
started the earliest in China, and had the largest group of 
users and provided most extensive learning resources, it met 
the needs of personalization and user diversity through inte- 
grating a variety of multi-network, terminals and resources. 
So far, the number of registered users has reached more than 
two million. The experiment is composed of 3 parts: user 
study, user clustering and result analysis. 


4.1 User study 


In the experiment, we ask 220 users (signified by s1, 52,..., 
$220) to learn C programming language online. The whole 
learning process, including recording and analyzing learning 
activities information, is accomplished in CETV. 


The public data sets regarding learners’ learning behaviors 
in online learning systems are insufficient, and most of them 
don’t contain labeled user clustering information. Mean- 
while, because learners always behave with certain subjec- 
tivity in online learning systems, to label learners with dif- 
ferent classifiers based on their learning behaviors only, but 
without the information behind, is not accurate. There- 
fore, we adopt a few user study methods, including self- 
assessment, peer-assessment and teacher-assessment [19], to 
label online learners with classifiers. It is the basis for veri- 
fying the accuracy of clustering. 


Analyzing the 20 attributes extracted from table 1 using 
user evaluation index system proposed in this paper, we 
can easily find that they can be mainly divided into 2 cate- 
gories. Some attributes belong to the category of “learning 
attitude”, which refers to students’ learning initiatives, like 
“Times of doing homework”, “Number of learning resources” 
and “Total time length of learning resources”. While the rest 
belong to the category of “learning effect”, which refers to 
how well students receive knowledge, like “Average correct 
rate of homework”, “Daily average quiz result” and “Com- 
prehensive test result”. Accordingly, we can label learners 
with these attributes from both categories. Each of the at- 
tributes has 3 grades - high, medium and low. Consequently 
every learner has 2 labels and each label has one grade of 
high, medium and low. In total, there will be 9 different 
combinations - high & high, high & medium, high & low, 
medium & high, medium & medium, medium & low, low & 
high, low & medium and low & low. 


After the students had finished a learning phase, we asked 
the 220 users to do a self-assessment using centesimal grade, 
respectively from perspectives of learning attitude and learn- 
ing effect. Then we requested teacher assessments in the 
same way, meaning the teacher of the subject to review the 
students’ performance. Finally, the students were asked to 
do peer-assessments, which means students do an assess- 
ment for each other. Each student will get the assessment 
scores from the rest 219 students. We calculate the aver- 
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Table 4: Integrated coupling representation of user attributes 


U . (a1)* (a1)?} (a2)* (a2)?| (as)* (a3)”] (aa)* (a4)?] (as)? (as)?} (ae)* (a6)? 
U1 3.85 3.80 | 0.70 0.70 | 2.20 1.46 | 3.24 3.23 | 3.35 3.70 | 3.76 3.81 
ue 4.54 4.50 | 1.34 1.34 | 2.89 1.98 | 3.66 3.65 | 3.82 4.31 | 4.37 4.51 
us 5.51 5.46 | 0.88 0.88 | 3.54 2.44 | 4.46 4.45 | 4.66 5.22 | 5.28 5.47 
ua 1.53 1.52 | 1.17 1.17] 1.01 0.80 | 1.03 1.02 | 1.06 1.42 | 1.44 1.52 
U5 5.94 5.89 | 0.94 0.94 | 3.73 2.49 | 4.95 4.94 | 5.17 5.68 | 5.75 5.90 


Table 5: Transformation rule between score and 
grade 
Score range Grade Sample 
80 < X < 100 high 95 
50 < X < 80 medium 75 
0< xX < 50 low 40 


Table 6: The evaluation results of s1 


learning learning 
attitude effect 
Self-assessment 
(40%) 80.0 75.0 
Teacher-assessment 
(35%) 85.0 80.0 
Peer-assessment 
2: 2 
(25%) 82.7 79 
Comprehensive 82.4 77.8 
evaluation results 
grade high medium 
Class high & medium 


age of the 219 scores. A student’s final score is obtained by 
integrating the 3 assessments above. According to Expert 
Investigation Weight Method [15], we did statistical analy- 
sis and got approximate weights for the assessments, name- 
ly 40% for self-assessment, 35% for teacher-assessment and 
25% for peer-assessment. Each student’s final score will be 
transformed into a grade value, “high”, “medium” or “low”. 
The transformation rule between score and grade is shown 
in table 5. 


Take student s; as an example, his 3 assessment scores and 
transformed grades are shown in table 6. 


4.2 User clustering 

In Equation (9), the proposed coupled representation is strong- 
ly dependent on how large L can be. Thus we conduct a few 
experiments to study how the performance of L influences 
the clustering accuracy of CUCA. The range of L value is 
from L = 1 to L = 10. With the growth of L value, L! 
value grows. When L = 10, it is large enough to capture 
most of the information in Equation (9). The experiments 
show that with the growth of L, the clustering accuracy will 
be gradually improved. When L = 3, the accuracy change 
reaches a comparatively stable status; when DL > 3, the ac- 
curacy change is extremely small. That means the accuracy 
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of when L = 3 and when L = 10 is quite similar. To guar- 
antee the accuracy of experimental results and reduce the 
complexity of the algorithm, we take L = 3 in the following 
comparative experiments. 


In the experiments, we utilize the attributes data gener- 
ated from the 220 students’ learning process, as the basis 
for clustering. Then we persistently collect data from the 
process which reaches 30 hours by average. Respectively 
with the help of K-means algorithm, Fuzzy C-means algo- 
rithm(FCM), NJW algorithm and CUCA algorithm, we do 
user clustering, getting 2 labels in terms of learning attitude 
and learning effect for each student. In section 4.1, we clas- 
sified each student with 2 labels based on user study result. 
Then we compare the labels got from user study and us- 
er clustering result. If only one label from each side is the 
same, the clustering accuracy rate is 50%; if both the labels 
are the same, the accuracy rate reaches 100%. For instance, 
student s1 is labeled with “high & medium” in user study, if 
he is classified to “medium & medium” cluster, the clustering 
accuracy rate is 50%; if he is classified to “high & medium” 
cluster, the accuracy rate reaches 100%. 


100.00% 


90.00% - 


80.00% -~ 


70.00% ~ 


60.00% 7— mj K-means 


50.00% -~ mFCM 


40.00% + NW 


Accuracy 


mCUCA 
30.00% ~— 


20.00% -~ 


10.00% - 


0.00% ~—*' T T —1 
Learning Attitude Learning Effect Integrated Dimension 


Figure 2: Clustering result analysis (30h) 


4.3 Result analysis 

We do comparison analysis on the clustering result respec- 
tively from the 3 dimensions of learning attitude, learning 
effect and the integrated dimension. The analysis result is 
shown in figure 2. We can see the clustering accuracy of uti- 
lizing CUCA is 89.4% for learning attitude, 87.3% for learn- 
ing effect and 74.6% for integrated dimension, each of which 
is higher than that with the other 3 algorithms. Especially, 
CUCA obviously outperforms the rest on clustering accu- 
racy of integrated dimensions. Compared with K-means, 
which performs the worst, CUCA improves almost 30% on 
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the clustering accuracy. The reason is CUCA fully takes 
into account coupling relationships of users. In Web-based 
learning systems, if the user attributes are more complicat- 
ed, there will be more clustering dimensions and the clus- 
tering accuracy will be improved more. 


—@K-means =F FCM —it—NIW -—<—CUCA 


100.00% 
90.00% 
80.00% 
70.00% 
60.00% 
50.00% 
40.00% 
30.00% 
20.00% 
10.00% 

0.00% T T \ T T 1 


Accuracy 


(a) Learning Attitude 


—@—K-means =—HFCM =te=NJW =<CUCA 


100.00% 
90.00% 
80.00% 
70.00% 
60.00% - 
50.00% 
40.00% 
30.00% 
20.00% 
10.00% 

0.00% T T T T T 1 


Accuracy 


(b) Learning Effect 


—@=K-means =H FCM =t=NJW =<CUCA 


80.00% 


70.00% 


60.00% 
50.00% 
40.00% = 


30.00% 


Accuracy 


20.00% 


10.00% 


0.00% T T T T , 1 


(c) Integrated Dimension 


Figure 3: Clustering result of different time phases 


If we divide the process of extracting user attributes to 6 
phases, namely 5h, 10h, 15h, 20h, 25h, 30h based on aver- 
age learning length, we can get the correlation between av- 
erage learning length and clustering accuracy, as shown in 
figure 3. From the figure, we can see that while the learning 
length grows, the clustering accuracy of the 4 algorithms 
keeps improving, specifically for CUCA. With CUCA, the 
clustering accuracy on integrated dimensions distinctly out- 
performs that of the 3 other algorithms. It indicates that 
with the increasing learning behavior data volume, CUCA 
can find the hidden coupling relationships of user attributes 
more easily, and the clustering accuracy is much better. 
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Besides, we can verify clustering accuracy through analyzing 
user clustering results. The best performance of a clustering 
algorithm is keeping the distance within clusters as small as 
possible and the distance between clusters as large as possi- 
ble. We use the evaluation criteria of Relative Distance (the 
ratio of average inter-cluster distance upon average intra- 
cluster distance) and Sum Distance (the sum of object dis- 
tances within all the clusters) to present the distance. The 
larger Relative Distance is and the smaller Sum Distance is, 
the better clustering results are. From figure 4, we can see 
that the Relative Distance for CUCA is larger than that of 
the 3 other algorithms, while the Sum Distance for CUCA 
is smaller. It indicates that CUCA outperforms the rest in 
terms of clustering structure. 
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Figure 4: Clustering structure analysis (30h) 


5. CONCLUSION 

A coupled user clustering algorithm (CUCA) for Web-based 
learning systems is proposed in this paper to capture cou- 
pling relationships of user attributes. The algorithm respec- 
tively takes intra-coupled and inter-coupled correlation into 
account in the application process, and utilizes Taylor-like 
expansion to represent the coupling relationship. Finally, 
with the usage of spectral clustering algorithm, CUCA is 
applied to do user clustering. In the experiments, user s- 
tudy, user clustering and result analysis are adopted to ver- 
ify that CUCA outperforms traditional algorithm for user 
clustering. 


In this paper, the user attributes extracted from user learn- 
ing behavior data are all numerical data, most of which are 
continuous data. In reality, there are also categorical data, 
which will be a significant study topic in the future. 
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